Pattern locator: a new tool for finding local sequence patterns in genomic DNA sequences
نویسندگان
چکیده
UNLABELLED We present a new tool for finding local sequence patterns in long DNA sequences. The program, Pattern Locator, uses an intuitive syntax for pattern description, and provides more flexibility than existing programs by allowing combinations of specific nucleotide sequences, direct and inverted repeats, variable length tandem repeats of subpatterns, and a specified number of errors in any part of the pattern. AVAILABILITY The program is available for download and as a web service accessible through a CGI interface at http://www.cmbl.uga.edu/software.html. The source code is written in C and distributed under the GNU General Public License.
منابع مشابه
High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملSimple Sequence Repeats Amplification: a Tool to Survey the Genetic Background of Olive Oils
A reliable DNA extraction method for use on extra virgin olive oil based on a commercial kit was defined, and the possibility of using this DNA for fingerprinting the original cultivar was demonstrated. The genetic traceability of single-cultivar virgin olive oil from two cultivars (Carolea and Frantoio) was achieved by identifying the varieties from which they were produced. This involved the ...
متن کاملComplete Genomic Sequence of a Strain of Tomato Yellow Leaf Curl Virus from Iran
Background and Aims: Tomato yellow leaf curl virus (TYLCV) is one of the most destructive viruses of tomato that leads to reduced tomato yield up to 100% in tropical and subtropical regions. In this study, the complete sequence of TYLCV isolate from Hormozgan province, Iran and its recombination evsent was determined. Methods: TYLCV infected tomato was collected from Hormozgan province. Total D...
متن کاملDuplex destabilization in superhelical DNA is predicted to occur at specific transcriptional regulatory regions.
Analytic methods that accurately calculate the extent of duplex destabilization induced in each base-pair of a DNA molecule by superhelical stresses are used to analyze several genomic DNA sequences. Sites predicted to be susceptible to stress-induced duplex destabilization (SIDD) are found to be closely associated with specific transcriptional regulatory regions. Operators within the promoters...
متن کاملDetection of Protein Coding Sequences Using a Mixture Model for Local Protein Amino Acid Sequence
Locating protein coding regions in genomic DNA is a critical step in accessing the information generated by large scale sequencing projects. Current methods for gene detection depend on statistical measures of content differences between coding and noncoding DNA in addition to the recognition of promoters, splice sites, and other regulatory sites. Here we explore the potential value of recurren...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 22 24 شماره
صفحات -
تاریخ انتشار 2006